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Abstract 

Let be a probability distribution with support on the non-negative 
integers. Four methods for generating a simple undirected graph 
with (approximate) degree distribution F are described and com¬ 
pared. Two methods are based on the so called configuration model 
with modifications ensuring a simple graph, one method is an exten¬ 
sion of the classical Erdos-Renyi graph where the edge probabilities 
are random variables, and the last method starts with a directed 
random graph which is then modified to a simple undirected graph. 
All methods are shown to give the correct distribution in the limit 
of large graph size, but under different assumptions on the degree 
distribution F and also using different order of operations. 

Keywords: Simple graphs, random graphs, degree distribution, gen¬ 
erating algorithms, configuration model. 
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1 Introduction 

A graph consists of two sets of objects: a set of vertices, V, and a set of 
edges, £. Each edge corresponds to a pair of vertices and the graph is 
said to be undirected if these pairs are unordered so that no directions are 
associated with the edges. Graphs with no duplicate edges and no loops 
- that is, with at most one edge between each pair of vertices and with 
no edges between a vertex and itself - are called simple. Furthermore, a 
graph is referred to as random if some kind of randomness is involved in its 
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construction. In this paper we will consider graphs that are random in that 
the edges are generated by random mechanisms. The question at issue is, 
given a set of vertices and a probability distribution F on the non-negative 
integers, how do we proceed to construct a simple undirected graph where 
the degree of a randomly chosen vertex has distribution FI 

The simplest random graph model is the Erdds-Renyi graph, which was 
introduced in the late 50’s by Paul Erdos and Alfred Renyi [am. In fact 
they introduced two, closely related, models. Given a set of n vertices and 
a number m < ( 2 ), the first model, denoted by Qn,m, is defined as the 
ensemble of graphs having exactly m edges, each possible graph appearing 
with equal probability. The second model, denoted by Gn,p, is obtained by 
independently adding each one of the ( 2 ) possible edges of the graph with 
some probability p. The distribution of the vertex degree is then binomial 
with parameters n — 1 and p and, if p is scaled by 1/n, we get a Poisson 
distribution in the limit as n —>■ 00 . Erdds-Renyi graphs have been widely 
studied and thorough descriptions of the field can be found in I1II2]. 

An area that has received a lot of attention within statistical physics dur¬ 
ing the last few years is the use of graphs as models for various types of 
complex networks; see e.g. [71 [IH] and the references therein. Examples of 
network structures that have been studied are social networks, power grids, 
the structure of the internet and various types of collaboration networks. 
Typically, this type of networks are very large, making it impossible to 
describe them in detail. A natural approach then is to let the edges repre¬ 
senting the connections in the network be generated by a random procedure 
designed so that the resulting graph captures the features of the real-life 
network in question as well as possible. Since, as mentioned, the networks 
are usually large, it is particularly urgent that the asymptotic properties 
of the graph model agree with empirical observations. 

An essential characteristic of a graph is the vertex degree and, in a random 
graph, this is a random quantity. For instance, as mentioned above, in the 
Gn,p model by Erdos and Renyi, the degree of a vertex is asymptotically 
Poisson distributed. The Erdds-Renyi graphs have a very simple and ap¬ 
pealing mathematical structure and a lot of work has been done on the 
model. However, empirical studies have shown that the degree distribution 
in many real-life networks differs significantly from a Poisson distribution; 
see e.g. [TS] (human sexual relationships), [T^ (physical structure of the 
internet) and [T] (movie actor collaboration network). Complex networks 
typically have a more heavy-tailed degree distribution, often specified by 
some kind of power law, meaning that the number of vertices with degree 
k is proportional to k~'^ for some exponent r > 1. This type of graphs is 
often referred to as scale-free graphs and there are important features of 
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such graphs that are missed out if they are approximated by Erdos-Renyi 
graphs; see e.g. [1]. 

In view of the above, it is important to be able to generate random graphs 
with other degree distributions than Poisson. The aim of this paper is 
to contribute at this point by describing a number of algorithms that, 
given a probability distribution F (which will later be subject to various 
restrictions), produces simple undirected graphs whose vertex degree is 
asymptotically distributed according to F (here, clearly it is required that 
Supp(E') C N, where Supp(F) denotes the support of F). To be more 
precise, given a set of n vertices and a random mechanism to generate 
edges between them, let p). denote the probability of a randomly chosen 
vertex having degree k and write F = {pk',k > 0}. Our task is then to 
design an edge mechanism such that 

(i) lim^^oopi,"^ = Pk\ 

(ii) the resulting graph is simple and undirected. 

In all applications mentioned above, the networks are simple and undi¬ 
rected. Other applications might involve more complex networks, for in¬ 
stance the link structure of the world-wide web constitutes a directed graph 
and bipartite graphs - that is, graphs with two types of vertices and edges 
running only between unlike types - are common within sociology. How¬ 
ever, simple undirected networks is indeed an important class in applica¬ 
tions. The present work provides a rigorous treatment of the asymptotic 
behavior of the vertex degree in a number of possible methods for generat¬ 
ing such graphs. Some of the methods/results are new, but an important 
point of the paper is also to survey and gather previously known material. 
To our knowledge, this type of comparative study with focus on the vertex 
degree does not exist previously in the literature. 

We mention also that there exist algorithms for generating simple graphs 
with a given degree sequence di,... ,(i„; here, di is a non-negative integer 
specifying the degree of vertex i. For example, in McKay and Wormald m 
an algorithm is described which, under certain conditions, produces a uni¬ 
formly selected simple undirected graph with the exact prescribed degree 
sequence. This is of course even better than the algorithms studied in the 
present paper which only has the correct degree distribution in the limit as 
the number of vertices tends to infinity. However, the price one has to pay 
for this is a more complicated algorithm and more restrictive assumptions 
about the degree distribution (the McKay and Wormald criterion corre¬ 
sponds to the degree distribution having moments of order 4 -|- e). In the 
sequel we are hence not interested in obtaining a specific degree sequence. 
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but only in proving that the distribution converges to the intended one in 
the limit. 

The rest of the paper is organized as follows. In Section 2 we review the 
well-known configuration model and describe how it can be used to generate 
simple graphs with an arbitrary prescribed degree distribution. Section 
3 treats a model inspired by Chung and Lu that generates simple 

graphs with mixed Poisson degree distributions. In Section 4 we propose a 
method that is based on the introduction of directed edges according to a 
suitably chosen distribution. This method produces graphs with a degree 
distribution whose generating function contains a Poisson factor. Finally, 
in Section 5 the methods are discussed and evaluated. 


2 The configuration model 

The configuration model was defined independently in [3] and m. both 
papers being inspired by [2] . The model has later been analyzed in [MllITl 
[IS] for instance. Given a probability distribution F, the model describes 
a way to construct an undirected graph on n vertices, labelled vi,... ,Vn, 
having degree distribution F. It is defined as follows. For each vertex 
Vi, generate a degree di independently from a random variable D with 
distribution F and attach di “stubs” to Vi. Then join the stubs of all 
vertices pairwise completely at random to form edges between the vertices. 
To be more precise, first pick two stubs randomly among all stubs in the 
graph and join them. Then pick two stubs at random from the remaining 
di — 2 stubs and join them, etc. 

A few problems might occur in the construction of a graph according to 
this algorithm. The first obvious problem is if the sum of all degrees, 
di, happens to be an odd number. In this case there will always be one 
remaining stub left over in the pairing algorithm. However, unless n is odd 
and F is concentrated to the odd numbers, this problem is easily solved 
by either regenerating the degrees until their sum is even or removing one 
stub chosen at random. More serious problems arise when the aim is to 
generate a simple undirected graph, that is, a graph without loops and 
multiple edges. In the configuration model, it is clearly possible for a stub 
of a given vertex Vi with > 2 to be matched with another one of the 
stubs of Vi, resulting in an edge from vertex Vi to itself, that is, a loop. 
Similarly, two stubs of Vi could by chance be joined with two stubs of the 
same other vertex, with the effect that a multiple edge is created. 

So what should we do if we insist on the resulting graph being simple? 
Two obvious suggestions are (1): to remove loops and merge multiple edges 
into single edges in the generated graph to obtain a simple graph as final 
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product, or (2): to redo the algorithm until a simple graph occurs by 
chance. These methods will be referred to as “Erased configuration model” 
and “Repeated configuration model” respectively. Both methods make the 
degree distribution somewhat different from the intended one, but, as we 
will see, both of them have the right degree distribution asymptotically 
under certain moment conditions on the degree distribution. 

2.1 The erased configuration model 

Let Fn = > 0} denote the degree distribution in the erased config¬ 

uration model with stub distribution F = {pj]j > 0}, that is, is the 
probability that a randomly selected vertex has degree j in the erased con¬ 
figuration model on n vertices. Also, write for the number of vertices 
having degree j in the resulting graph. 

Theorem 2.1 If E has finite mean, then 

(a) Fn ^ F as n —>■ oo; 

(b) /n —>■ pj in probability, that is, the empirical distribution con¬ 
verges in probability to F. 

The proof of this theorem is a bit lengthy, although not hard. It is found 
in the Appendix. 

Theorem l2.1l is not true without the assumption that the degree distribution 
F has finite mean, at least not when the tail decays like 1 — F{x) ^ cx~°‘ 
where a < 1. A heuristic argument for this goes as follows. For such a decay 
of the tail, it is known that X]r=i ma‘X{i<i<ra} Di are of the same 

order, see e.g. [11, Ch 13.11]. As a consequence, the probability that a stub 
is connected to a stub of the maximal node is non-negligible. It follows that 
any node with original degree 2 or more has positive probability (bounded 
away from 0) to have more than one stub connected to the maximal node. 
But whenever this happens, the degree of the node is decreased in the 
erased configuration model and it follows that the new degree distribution 
will converge to a distribution stochastically smaller than F. 

In view of the above, there is no hope that the degree distribution will stay 
unaffected by the erasing procedure when the mean is infinite. However, 
if the degrees are conditioned to be smaller than for some a £ (0,1) it 
turns out that Theorem 12.11 remains valid. Indeed, in many applications it 
is artificial to include vertices with degree larger than for some a £ (0,1), 
and hence it is sometimes natural with this type of conditioned degrees. 


5 


Write Fn^a for the degree distribution in the erased configuration model on 
n vertices where the number of stubs of vertex Vi has distribution 

= j) = ^ for j = 0,..., n“, 

with Di ^ F. Also, let be the number of vertices with degree j in 

the graph after loops and multiple edges have been erased. 

Theorem 2.2 For any a G (0,1), we have 

(a) Fn^a F as n ^ oo; 

(b) —>■ pj in probability. 

Except for a few minor modifications, the proof of this theorem is analo¬ 
gous to the proof of Theorem 12.11 The modifications are described in the 
Appendix. 


2.2 The repeated configuration model 

The repeated configuration model consists of performing the configuration 
model until it produces a simple graph. As pointed out in m, it follows 
from results in |13j that the probability of obtaining a simple graph in 
the configuration model converges to a strictly positive constant c if the 
degree distribution has finite second moment. This implies that a simple 
graph is then obtained after a geometrically distributed number of tries. 
Of course, such a graph might not be typical for the configuration model. 
In particular one might suspect that the number of edges is somewhat 
smaller than normal, since there by chance were no multiple edges or loops. 
However, below we use the result from |16j to show that the resulting degree 
distribution converges to the right one provided that it has finite second 
moment. In fact, we show the stronger result that the empirical degree 
distribution converges to the intended distribution. 

Let Fn = ; j > 0} be the degree distribution of the repeated configura¬ 

tion model on n vertices with stub distribution F = {pj\j > 0} and write 

(n) 

iVj for the number of vertices having degree j in the resulting graph. 

Proposition 2.1 Assume that F has finite second moment. Then 

(a) Fn ^ F as n —>■ oo; 

(b) /n ^ pj in probability. 
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Proof: We first show part (b). Let Di,..., Dn be i.i.d. random variables 
with distribution F and let = {{Di'Di = j,i = 1,..., n}|/n denote the 
empirical distribution of these n variables; here | • | denotes set cardinality. 
Also, write Sn for the event that the configuration model on n vertices 
produces a simple graph. The empirical distribution of the repeated con¬ 
figuration model is the same as the distribution of the vector with elements 
conditioned on Sn and we hence have to show that 


— Pj\ > e I Sn) —0 as n —>■ oo for any e > 0 and any j. (1) 
Trivially, we have 


^ - Pji > e i Sn'j = — 

P 

< — 

The numerator here converges to 0 by the law of large numbers and, by 
the cited result of m, the assumption that F has finite second moment 
implies that P(5'„) —>■ c > 0. Hence ([T]) follows. 

To show (a), note that, since 0 < /n < 1, by dominated convergence, 
the result in (b) implies that E[ivj"^/n] —>■ pj. But = 

fnl 

np) , and the desired result follows. □ 

3 The generalized random graph 

In an Erdos-Renyi graph on n vertices, the edges are defined by independent 
Bernoulli random variables with P{Xij = 1) = p, the event Xij = 

1 signifying the presence of an undirected edge between Vi and Vj. By 
definition, Xji = Xij for i < j and Xu = 0 for all i. In this section, we 
consider a model where the probability pij of an edge between two vertices 
Vi and Vj is allowed to depend on i and j. Special cases of this have been 
considered in [SllillD]. We will show that, if the probabilities {pij} are 
picked randomly in a suitable way, we get a graph with a degree distribution 
that is easy to characterize in the limit when n —>■ oo; see Theorem 13.11 
Also, the degrees of the vertices are approximatively independent. The 
model will be referred to as the generalized random graph. 


P{Sn) 

-Pj\ > e) 
P(Sn) ■ 
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First, we develop the model in more detail. To this end, let X = {Xijji^j 
be the array of edge indicators and write P{Xij = 1) = pij = 1 — qij. Since 
the indicators are independent, the probability density of X is given by 

i<j 

Introducing the odds ratios = Pij/qij and noting that pij = Vij/{1 + rij) 
and qij = 1 / {1 + Tij), this can be written 

P{X = x)=W{l + nj)-^\{rl;\ 

i<j i<j 

Moreover, if we specialize to the situation where = UiUj for some pa¬ 
rameters u = {ui}^^i with Ui > 0 and define G{u) := + UiUj), we 

get 


Pu(X = x) = G-\u)l[{u,Ujr^ 

i<j 

= G-\u)l[uf^^\ ( 2 ) 

i 

where di{x) is the degree of the vertex Vi in the configuration a;, that is, 
di{x) := J2j This is a “canonical” distribution in the sense of statistical 
mechanics with sufficient statistics {(ii(X)} and from ([2) we see that the 
conditional distribution of X given that {di{X) = di} is uniform, that is, 
all graphs with a given degree sequence {di} have the same probability. 
This is indeed a nice property of the model, motivating the use of the 
parametrization Vij = UiUj instead of the one defined by pij = UiUj used 

in [SI IS]. 

To obtain a formula for the joint generating function of the degree vector 
{di(X)}, note that, by (Hj), we have 


E, 


[n 


di(X) 


I 


J2PuiX = x)l[tf^^^ 

X i 


Since J^x^^iiX = x) = 1, it follows from ([2]) that = G{u), 

and hence we get 



G ^{u)G{tu) 


i 

n l + UujtjUj 

l + u,u, ■ ^ ^ 

i<j 

Now consider the situation where the parameters {ui} are suitably scaled 
random variables, more precisely, we set Ui = Wijy/n, where {Wi} are 
i.i.d. random variables with finite mean and Wi > 0. Write {Di\ for 
the degrees of the vertices in this setting, that is, Di = di{X) = di{X{W)). 
The following theorem specifies the liming distribution of the Di'.s. 


Theorem 3.1 Consider a generalized random graph on n vertices with 
edge probabilities defined by Pij/qij = WiWjjn, where {Wi} are i.i.d. 
random variables with mean fiw and finite moment of order 1 + £ for some 
£ > 0. We have: 


(a) The limiting distribution of a degree variable Dk as n —>■ oo is mixed 
Poisson with parameter WkPw 

(b) For any m, the variables Hi,... Dm are asymptotically independent. 

Proof: By taking tk = t, where 0 < t < 1, and ti = 1 for i ^ k va. ©, it 
follows that 

1 + WiWktIn 


= E 


n 

'* i^k 


1 + WiWk/n _ 


Using the Taylor expansion log(l + x) = x + 0{x'^), we see that 


-p-r 1 + WiWkt/n 


where = 0{W^ '^i l"^)- To estimate i?„, note that Wf < maxi{W;}lU. 

The law of large numbers implies that Wijn —>■ pw and, since the Wi'.s 

have finite 1 + £-moment, we have that maxi<i<„{Wi}/n —0. It follows 
that Rn converges almost surely to 0 as n —> oo. Hence 

^ as n ^ oo, 

and part (a) follows. To establish (b), note that, by taking U = 1 for f > m 
in (Is]) and proceeding as in proving (a), it can be seen that 



'-i=l 


i=l 


as n —>■ oo. 
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Hence the joint generating function of {Di,..., Dm) asymptotically fac¬ 
torizes into a product of mixed Poisson generating functions, as desired. 
□ 

Now recall that our task is to generate a simple random graph with a given 
degree distribution F. According to the above theorem, if F is mixed 
Poisson with parameter distribution Q with finite moment of order 1 -I- e, 
then this can be done by using the generalized random graph model with 
i.i.d. weights {Wi} distributed according to Q !where [Xq denotes the 
mean of Q. As mentioned in the introduction, the degree distribution in 
many real-life networks is heavy-tailed, the probability of a vertex having 
degree k being proportional to k~'^ for some exponent r > 1. It is not 
hard to see that heavy-tailed mixed Poisson distributions with this type 
of power law behavior can be accomplished by choosing a heavy-tailed 
parameter distribution with the desired exponent. 

In this context it is clearly of interest to know to what extent Theorem 3.1 
is still true if the distribution of {Wi} has a heavy tail such that A[ITi] = 
oo. Indeed, according to the theorem below, if we assume that the tail of 
the distribution varies regularly in the sense that P{Wi > w) ~ cw~°‘ as 
w —>■ oo, for some a € (0,1) and some constant c > 0, then the distribution 
of the Dk'.s is still mixed Poisson, but with a different scaling and different 
mixing distribution. Note that a power law distribution with exponent 
T G (1,2) satisfies the tail condition with a = t — 1 . 

Theorem 3.2 Suppose that {Wi} are i.i.d. with P{Wi > w) ~ cw~°‘, for 
some a G (0,1) and c > 0, and consider the generalized random graph with 
PijlQij = WiWjliT}/^^. Then: 

(a) The limiting distribution of a degree variable Dk is mixed Poisson 
with parameter ylT^ where 7 = c / q °°(1 + x) '^x °dx. 

(b) For any m, the variables Hi,..., Dm are asymptotically independent. 

Remark As pointed out above, the tail behavior of a mixed Poisson distri¬ 
bution with a power law parameter is determined by the parameter. Hence, 
for a mixed Poisson variable Y with parameter ■jWj^, we have that 


P{Y > y) 


Piw^ > y) 

P{Wk > 


that is, the distribution is a power law with exponent r = 2. 
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Proof of Theorem I3.2t As in Theorem 13.11 we have 


E = E 


n 

i^k 


1 + W^Wkt|n^/‘^ 

l + W^Wklny°‘ 


Let us first fix the value of Wk, say Wk = w. Then 


$„H ■.= E[t^'‘\Wk 


w] = E 


i^k 


ni/a 


( 4 ) 


with (p(x) := (1 +a;'u;t)/(l +xw). Write V(x) for the distribution function 
of Wi- Since all {Wi} are independent, we have 


nOO \ n — 1 

1 + y {ipiyx) — \)V{n^^°‘dx)\ 

poo 

1+ / (fi'{x)(^l — V{n^^°'x))dx 

Jo 


where the last equality follows from partial integration. In order to see that 
the last integral is 0(l/n), note that 


n J If'{x)(^l — V{n^^°‘x))dx = J ^ (n^^°‘x)°‘(l — V(n^‘^°‘x))dx. 

By the assumption, (1 — V(y)) is a bounded function which converges to 
c > 0 as y —>■ oo, and hence, by bounded convergence. 





n-^oo Jq 


X = 


ip'{x) 


dx 


= {t- l)u;“7. 


where 7 := c (l+x)~^x~°‘dx. It follows that lim„_>oo 4>„(w) = 
which is recognized as the generating function of a Poisson distribution with 
mean ^w°‘. Integrating over w = Wk, we see that the limit distribution of 
Dk is mixed Poisson with mean 7lP^ and generating function 


E = E 




f 

JO 




'V{dx). 
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The proof of (b) is analogous to the proof of Theorem 13.11 and is therefore 
omitted. □ 

We hnish this section by showing that the empirical degree distribution in 
the generalized random graph converges to the asymptotic mixed Poisson 
degree distribution in the graph. To this end, when {Wi} have finite mean 
fj,^, write for the number of vertices having degree k in the generalized 
random graph with edge probabilities defined by Pijlqij = WiWj/n, and 
let F = {pki fc > 0} be a mixed Poisson distribution with parameter Wpw. 
Similarly, when P{Wi > w) cw~°‘ for some a G (0, 1 ), the number of ver¬ 
tices having degree k for edge probabilities defined by Pijjqij = WiWj 
is denoted by and we write = {p^; fc > 0} for a mixed Poisson 

distribution with parameter ylT", where 7 is defined in Theorem 13.21 

Proposition 3.1 As n —>■ 00 in the generalized random graph, we have: 

(a) If {Wi} have finite moment of order 1 -I- e, then /n —>■ pk in 
probability for all k. 

(b) If P{Wi > w) ^ cw~°‘ for some a G (0,1), then jn —>• in 

probability for all k. 

Proof: The proofs of part (a) and part (b) are analogous and we give 
here the proof of (a). Write for the probability law of the generalized 
random graph on n vertices and let Ipj denote the indicator function. 

Clearly = Yll=i l{Di=fe} and hence, using symmetry, it follows that 


n 

= nP^^\Dx=k). 

By Theorem 13.II fab we have P^^^Di = k) Pk as n —>■ 00 , meaning that 
/n] —>■ Pk- The desired result is now obtained from Chebyshev’s 
inequality if we can show that Var(7V^"^/n) —>■ 0. To do this, note that 


E 


«'Y 


i i^j 

nP(”)(Pi = fc) -hn(n- I)p(”)(Pi = fc, P 2 = fc), 


where the last equality follows from symmetry. By Theorem 13.11 (b), the 
variables Di and D 2 are asymptotically independent and hence we have 
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p(")(Di = k,D2 = k) p\ as n —>■ oo. Using the formula Var(Jf) = 
E[X^] — E[X]^, it follows that Var(A^^”^/n) —>• 0 as n —>■ oo and we are 


done. 


□ 


4 Directed graphs with removed directions 

In this section we propose a construction method where directed edges are 
introduced according to some distribution G. The directions of the edges 
are then disregarded and multiple edges are fused together, producing a 
graph whose asymptotic degree distribution F is the convolution of the 
distribution G and a Poisson distribution with parameter /Tg, where pc 
denotes the mean of G; see ProDOsition l4.ll 

To describe the construction, write V = {ui,..., Un} for the vertex set. Let 
G be a probability distribution with finite mean pc and Supp(G) C N, and 
write {gk} for the probabilities associated with G. Also, define Gn via the 
probabilities 



for fc = 0,1,..., n — 2; 
for k = n — 1] 
for k > n, 


that is, Gn is a truncated version of G with support on {0,l,...,n — 1}. 
The graph is constructed as follows: 

1. Associate independently to each vertex Vi a random variable Yi with 
distribution G„, and add to the graph Yi directed edges pointing out 
from Vi. The vertices to be hit by the edges starting at Vi are chosen 
randomly without replacement from V \ {vi}, independently for all 
vertices. This defines a directed random graph Gdirin, G) = {V, £dir}- 

2. To obtain a simple undirected graph C/(n, G) = {V,£}, the directions 
of the edges are disregarded and multiple edges are fused together, 
that is, an undirected edge between the vertices Vi and Vj is included 
in £ as soon as at least one of the directed edges {vi,Vj) and {vj, Vi) 
is present in £dir- 

Let Di denote the degree of the vertex vt in the resulting undirected graph. 
To find an expression for Di, write for the set of vertices that are hit 
by edges pointing out from Vi in £dir and write V™ for the set of vertices 
that sends outgoing edges to Vi in £dir- Define Zi = |V“ fl (here, 

-> denotes set complement) so that, in words, Zi indicates the number of 
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edges in £dir pointing at Vi and starting at vertices that are not hit by 
outgoing edges from Vi. Some thought reveals that 

Di = 

Clearly all variables {Zi} have the same distribution, which we denote 
by Also remember that Yi ~ Gn for all i. Hence the degree 

variables {Di} are identically distributed and we write F(^n,G) for their 
distribution. The following theorem is the aforementioned characterization 
of the asymptotic degree distribution as the convolution of G and a Poisson 
distribution with the same mean as G. 


Theorem 4.1 As n — >■ oo, we have 

(a) Gn —>■ G; 

(b) H(^n,G) Po(/iG), where Po(/rG) is a Poisson distribution with mean 

(c) F(^n,G) -t G*Po(^g)- 

Proof: The claim in (a) is immediate from the definition of Gn- To prove 
(b), fix a vertex Vk and, for i ^ k, let Xik be a 0-1 variable, indicating 
whether there is a directed edge from Vi to Vk in £dir or not. Since there 
are Yi outgoing edges from Vi and the vertices to be hit by these edges are 
chosen randomly without replacement from the n — 1 vertices in V\{vi}, 
we have P{Xik = 1) = Yij{n — 1). Also, we have that 


Zk — ^ ^ Xik- 

Hence, by conditioning on {Yi}, we obtain 


E = E 




and, since {Yi} are i.i.d. with distribution G„ and |“'V^“*\{ufe}| = n—1—Yfc, 
it follows that 


E [t 


Zk] _ 


n — 1 


E 


n — 1 


n-l+ - 1) 


Yk 


Here denotes the mean of the distribution G„. As n —>■ oo, the first 
factor on the right hand side converges to which is recognized as 
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the moment generating function of a Poisson variable with parameter /ic, 
and the left hand side converges to 1. Hence part (b) is established. 

To prove (c), note that E = E [t^'^E [t^''|yfe]]. The inner expec¬ 

tation is calculated as above and we get 





MGn 
n — 1 



n —1 


E 



t{n — 1) 

1 + MG„(i - 1) 


Yk 


As n —>■ oo, the second factor converges to the generating function of G and, 
as pointed out above, the limit of the first factor is the moment generating 
function of a Poisson distribution with mean fiQ- 

Having proved that the asymptotic distribution of the vertex degree in 
C/(n, G) is G*Po{pg), the obvious question is which distributions can arise 
in this way. We will not give a full answer to this question - it is presumably 
difficult - but rather give a few examples of distributions that can indeed be 
obtained and also specify how the distribution G of the number of outgoing 
arrows at the vertices should be chosen in these cases. 

Power law distribution 

Eirst note that, if C? is a power law distribution with exponent r, then 
G * Po(p-g) will be so as well. Hence, if we are not interested in the exact 
form of the resulting degree distribution, but only in that its tail decays as 
a certain power law, then the C/(n, G)-model is clearly applicable. 

Poisson distribution 

The simplest case when the exact form of the resulting degree distribution 
is important is the Poisson distribution. Clearly, a Poisson distribution F 
with parameter fip is accomplished by choosing G to be a Poisson distri¬ 
bution with parameter 

Mixed Poisson distribution 

A mixed Poisson distribution can be obtained as a limiting degree distribu¬ 
tion in t/(n, G) given a certain condition ([5]) on the law of the parameter. 
To see this, assume that F is mixed Poisson with parameter law Q with 
finite mean (Iq. The moment generating function of F then equals 

pOO 

Jo 

Since it should hold that F = G*Po{nc), we have ipc{t) = 
and = ^.q/2, and hence 

poo 

^0 
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To ensure that this is the generating function of a probability distribution, 
let = inf Supp((5) and assume that 

Cq - /ic3/2 > 0. (5) 

Then 

poo ^ 

'ipcit) = / 

Jo 

where Q{y) = Q{y + A^q/ 2), that is, Q is the distribution Q translated 
y,Q/2 units to the left. This means that G is mixed Poisson with param¬ 
eter distribution Q. Hence, a mixed Poisson distribution with parameter 
distribution Q that satisfies ^q — > 0, is obtained as a limiting degree 

distribution in t/(n, G) by choosing G to be mixed Poisson with parameter 
distribution Q. 

Compound Poisson distribution 

Let F be compound Poisson with Poisson parameter A and discrete sum¬ 
mand distribution R with finite mean yn and generating function ip nit) 
(recall that a compound Poisson distribution is the law of a sum of a Pois¬ 
son number of i.i.d. random variables). Then fip = Xfj,p and ippit) = 
gA(bii(i)-i)^ and, if we want the limiting degree distribution in Q(n,G) to 
be F ^ then we must have 


ipoit) = 

Here, since /Tg = A^k/ 2 and ipR{t) = where {r^} denotes the 

probabilities associated with i?, the exponent in ([S]) can be written as 


A 



)<+i: 





Assume that 


ri > AiB/2, 

and introduce a new distribution R' by defining 


r 


! _ 

k 


fo + Mii/2 for fc = 0; 
ri — /iji/2 for fc = 1; 
fk for k > 2, 


(7) 


16 


that is, R' is obtained by transferring the mass ijlr/2 from the point 1 to 
the point 0 in the distribution R (note that a consequence of ([7]) is that R 
must be chosen so that < 2). With R' defined in this way we have 

that is, G is a compound Poisson distribution with Poisson parameter A 
and summand distribution R'. Hence a compound Poisson distribution 
F with summand distribution R that satisfies © is obtained as limiting 
degree distribution in C/(n, G) by choosing G to be a compound Poisson 
distribution with summands distributed according to R'. 


5 Concluding comments 

In the present paper, four different ways of generating simple undirected 
graphs with a prescribed degree distribution are described. The methods 
are referred to as the erased configuration model, the repeated configu¬ 
ration model, the generalized random graph and the directed graph with 
removed directions (DGRD) respectively. None of the methods is able to 
produce a graph that has the desired distribution exactly - that is, in a 
finite graph, a randomly selected vertex will not have exactly the correct 
degree distribution - but under certain regularity assumptions, it is shown 
that all four methods give the right distribution in the limit as the number 
of vertices n tends to infinity. 

Let us summarize the assumptions on the degree distribution for the dif¬ 
ferent methods: In order for the repeated configuration model to produce 
a simple graph in stochastically bounded time as n —>■ oo, the second mo¬ 
ment of the degree distribution has to be finite and for the generalized 
random graph model with edge probabilities scaled by n to be applicable, 
finite moment of order 1 -I- e for some £ > 0 is required. For the other 
two methods, finite mean is sufficient. If the degrees are conditioned on 
being smaller than n“ for some a G (0,1), the erased configuration model 
can handle distributions with infinite mean as well, and, with a different 
scaling of the edge probability, also the generalized random graph can be 
applied to infinite mean distributions. 

As for the class of achievable distributions, the erased configuration model 
and the repeated configuration model are both able to generate graphs with 
any limiting distribution. The generalized random graph model can only 
produce mixed Poisson distributions and the DGRD-model gives distribu¬ 
tions that can be expressed as the convolution of a discrete distribution with 
finite mean and a Poisson distribution with the same mean - a class con¬ 
taining certain types of mixed Poisson and compound Poisson distributions 
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for instance. However, if only tail properties of the desired distribution are 
specified, both the generalized random graph model and the DGRD-model 
can do the job. 

Concerning the number of operations needed to produce the graph, it is 
easily seen to be of order n for all methods except the generalized random 
graph, which requires 0{n^) operations. Note however that an approxima¬ 
tion of the generalized random graph model that uses only 0{n) operations 
can be obtained by replacing the conditional degree distribution of a given 
vertex, conditional on the Wi:s, with a Poisson distribution with the same 
mean. Among the methods using 0(n) operations the erased configuration 
model and the DGRD-model require less operations: The repeated con- 
hguration model generates a graph a geometrically distributed number of 
times whereas the other two methods only generates a graph once and then 
erases a few edges. 

A perhaps more subjective opinion is that the generalized random graph 
model is probabilistically more tractable than the other methods. Its con¬ 
struction is straightforward, containing less dependence structures, imply¬ 
ing that it is easier to show property results for this model. Also, it was 
easily seen from the construction of the general random graph that the 
obtained graph is uniform in the sense that all graphs with a given degree 
sequence have the same probability. 

Finally we mention that, apart from the degree distribution, there is of 
course a number of other properties of a graph of which it would also 
be desirable to have control over, for instance clustering and path length. 
Rigorous analyzes of algorithms incorporating these aspects is to a large 
extent lacking and it is a future task to further investigate and develop 
methods for generating random graphs specifying such properties. 

Acknowledgement We thank Remco van der Hofstad for giving the idea 
of how to strengthen Theorem 12.II to cover also the empirical degree distri¬ 
bution, and for suggesting Theorem l2.2l Maria Deijfen also thanks van der 
Hofstad for interesting and enlightening discussions on the topic of random 
graphs. 


References 

[1] Barabasi, A.-L. and Albert, R. (1999): Emergence of scaling in random 
networks, Science 286, 509-512. 

[2] Bender, E. and Canfield, E. (1978): The asymptotic number of labelled 
graphs with given degree sequences, J. Comb. Th. A 24, 296-307. 


18 


[3] Bollobas, B. (1980): A probabilistic proof of an asymptotic formula for the 
number of labelled regular graphs. European J. Combin., 1, 311-316. 

[4] Bollobas, B. (2001): Random Graphs, Academic Press, 2nd ed. 

[5] Chung, F. and Lu, L. (2002:1): Connected components in random graphs 
with given degrees seqnences, Ann. Comb. 6, 125-145. 

[6] Chung, F. and Lu, L. (2002:2): The average distances in random graphs 
with given expected degrees, Proc. Natl. Acad. Sci. 99, 15879-15882. 

[7] Dorogovtsev, S. and Mendes, J. (2003): Evolution of Networks, from Bio¬ 
logical Nets to the Internet and WWW, Oxford University Press. 

[8] Erdos, P. and Renyi, A. (1959): On random graphs, Publ. Math. 6, 290-297. 

[9] Erdos, P. and Renyi, A. (1960): On the evolntion of random graphs, Publ. 
Math. Inst. Hung. Acad. Sci. 5, 17-61. 

[10] Faloutsos, M., Faloutsos P. and Faloutsos, C. (1999): On power-law rela¬ 
tionships of the internet topology. Comp. Comm. Rev. 29, 251-262. 

[11] Feller, W (1966): An introduction to probability theory and its application, 
vol II, John Wiley. 

[12] Janson, S., Lnczak, T. and Rucinski, A. (1999): Random Graphs, John 
Wiley. 

[13] McKay, B. (1985): Asymptotics for symmetric 0-1 matrices with prescribed 
row snms, Ars. Comb. 19A, 15-20. 

[14] McKay, B.D. and Wormald, N.C.(1990): Uniform generation of random 
regnlar graphs of moderate degree, J. Algorithms 11, 52-67. 

[15] Liljeros, F., Edling, C., Amaral, L., Stanley, H. and Aberg, Y. (2001): The 
web of human sexnal contacts. Nature 411, 907-908. 

[16] Molloy, M. and Reed, B. (1995): A critical point for random graphs with a 
given degree sequence, Rand. Struct. Alg. 6, 161-179. 

[17] Molloy, M. and Reed, B. (1998): The size of the giant component of a 
random graphs with a given degree seqnence. Comb. Prob. Comp. 7 , 295- 
305. 

[18] Newman (2003): The strncture and fnnction of complex networks, SIAM 
Rev. 45, 167-256. 

[19] Newman, M., Strogatz, S. and Watts, D. (2001): Random graphs with arbi¬ 
trary degree distributions and their applications, Phys. Rev. E 64, 026118. 

[20] Soderberg, B. (2002): A general formalism for inhomogeneous random 
graphs. Phys. Rev. E 66, 066121. 

[21] Wormald, N.C. (1978): Some problems in the ennmeration of labelled 
graphs. Doctoral thesis, Newcastle University. 


19 



Appendix 


Proof of Theorem I2.lt First note that, since 0 < /n < 1, by 
dominated convergence, the result in (b) implies that —)• pj. 

But ~ nPj^\ and hence (a) follows from (b). 


To show (b), write for the number of vertices that has degree j before 
edges are erased to make the graph simple. By the law of large numbers 
/n —>■ Pj as n —>■ oo and hence we are done if we can show that 

^ 0 in probability as n —>■ oo. Let be the number 

of vertices where at least one stub is removed in the erasing procedure and 
note that ivj"^ — Hence, by Markovs inequality, it suffices 

to show that E[M^"')]/n —>■ 0. To do this, for i = 1,..., n, let Ei be the 
number of stubs attached to Vi that are rubbed out in the erasing procedure 
and define 


r 1 ifE, >1; 

\ 0 if E, = 0. 


Also, let Di be the degree of vertex Vi before loops and multiple edges have 
been erased and write for the probability law of the erased configura¬ 
tion model on n vertices (that is, averaged out also over the original degrees 
{Di}). Since and are equally distributed, we 

have 



n 




-Ee 

n 




(n) 


E 

P(")(P1 > 1). 


The work lies in proving that P’^")(Pi >1)—J-Oasn—>' 00 ,or equivalently, 
that = 0) —>■ 1. Clearly this follows if we show that 


P^^\Ei = 0\Di = j) —>• 1 for all j as n —>■ oo. (8) 

To prove ([5]), write Aj for the event that a given stub belonging to a vertex 
with j stubs in total avoids being removed in the erasing procedure. Below 
we show that 


P^^\Aj) —>■ 1 for all j as n —>■ oo. (9) 

This establishes (|S]): After having merged one of the j stubs of the ver¬ 
tex vi to a stub belonging to some other vertex, saving it from being 
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erased, the probability that a fixed one of the other j — 1 stubs are erased 
equals since the fact that one stub from the other vertices is 

no longer available for merging is asymptotically negligible. This can then 
be repeated until there are no remaining stubs of vi and ([5]) follows by 
noting that P^^\Ei = 0\Di =0) = 1. 

For the proof to be complete it remains to prove ([9]). To do this, first 
remember that a stub can be erased for two reasons: because it forms a 
loop and because it is part of a multiple edge. For the sake of completeness 
we also include the case when a randomly selected stub is removed if the 
total number of stubs is odd. Now, consider a fixed stub belonging to a 
vertex v with j stubs in all. Write and for the events that the 

stub is not part of a loop and a multiple edge respectively and let A°^'^ 
be the event that the stub is not removed as the randomly selected “odd” 
stub. To estimate the probabilities of these events, we condition on that 

(n) 

the total number of stubs equals m and write Pm for the corresponding 
conditioned probability measure. If m is odd, the probability that the stub 
is removed as the “odd” stub is 1/m and, if m is even, the probability is 
0. Hence 


P^'i {Af‘^) > 1 - - ( 10 ) 

For the stub to form a loop it has to be joined to one of the other / — 1 
stubs of the vertex v and, since clearly the stub is matched to each one of 
the other m — 1 stubs in the graph with the same probability l/(m — 1), 
this happens with probability (j — l)/(m — 1), that is. 



To compute the probability that the stub is not part of a multiple edge, 
assume that it does not make up a loop and condition on the degree k of 
the vertex v' of the stub to which it is joined. Also, number the remaining 
j — 1 stubs of the vertex v in some arbitrary way from 1 to / — 1 and let 
Bj be the event that there are no loops among these stubs. Trivially, 

p{n) ^ p{n) ^j^mult p 

For the event fl Bj to happen, none of the remaining j — 1 stubs 

of the vertex v can connect to another stub of u or to a stub originating 
from v'. Considering the stubs 1,..., / — 1 one in a turn, we see that the 
probability that stub number 1 avoids being connected to a stub of v or v' is 
{m—j — k)/(m — S) (the denominator comes from that two stubs are already 
used in the fixed connection between v and v' and the stub cannot join to 
itself). Then, given that stub number 1 is connected to some other vertex. 
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the probability that stub number 2 is so as well is (m — j — fc — l)/(m — 5), 
and so on. Furthermore, it is only possible for n Bj to happen if 

j — i<rn—k — j, since otherwise loops among the j — 1 stubs on vertex 
V or multiple edges between v and v' cannot be avoided. Hence, writing 
fc fo'' the probability measure conditioned on m and k, we have 


p!ni (Ar'^nB,) 

implying that 


m — k—j m — k—j — 1 
m—S m —5 


0 


m — k—2j-\-2 


iij — l<m — k — j 
otherwise. 




m — k — 2j + 2 
TO — 3 


i-i 


( 12 ) 


where r+ = max{r,0}. Combining (flUl) . (ITT]) and (HU and using Boole’s 
inequality, it follows that 


P^niiAj) = 

^ / TO-fc-2j y~^ 1 j-i 

“ \ TO y_|_ TO TO— 1 

Removing the conditioning on to and k and denoting the corresponding 
random variable and iT„ respectively, we get 


(Aj) > E 


'/Ln-Kn-2jY 

- E 

■ 1 ■ 

- E 

’ j - 1 ’ 

[v Ln \ 




[Ln-lj 


(13) 


To complete the proof, we use dominated convergence to show that the 
right hand side of (USD converges to 1 as n —>■ oo, establishing ®- Recall 
that Ln is the total number of stubs in the configuration and Kn is the 
number of stubs connected to the vertex of a randomly selected stub. The 
total number of stubs is a sum of n i.i.d. random variables {Di} with 
distribution F and mean ^p, which is finite by assumption. Hence, the 
law of large numbers implies that Lnjn —>■ /i^ almost surely as n —> oo. 
The conditional distribution of iF„ given is specified by P{Kn = 

i) = /'^^rNr^\ where is the number of Df.s that equal i. 

Since /n —?> pi as n —)> oo and ^p < oo, it follows that converges 
in distribution to a proper random variable K with distribution P{K = 
i) = ipi/ = ipi/pLp and hence Ri„/n —?> 0 almost surely as n —>■ oo. 

Combining these two observations, we get that 


/ Ln - Kn - 2j 
V Ln 


J-l 

^ 1 

+ 


a.s. as n —>■ oo. 
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Furthermore, since j and are both strictly positive, we have 


0 < 



2j 


i-i 

< 1 - 

+ 


By dominated convergence, it follows that the first term on the right hand 
side of (USD converges to 1, and it is easily seen that the other two terms 
converge to 0. Hence the proof is complete. □ 

Proof of Theorem 12.21 Fix a G (0,1) and let the degrees be distributed 
according to Fn,a- Clearly, if F has finite mean, the claim follows im¬ 
mediately from Theorem 12.11 so assume that F has infinite mean. The 
proof of Theorem 12.11 then remains valid all the way to the estimate (TT^ . 
We need to see that the right hand side tends to 1 for all j as n —>■ oo. 
To this end, let fl be the expectation of F conditioned on being smaller 
than, say, the smallest possible value in the support plus 10. Then clearly 
P{Ln > fm) —^ 1 as n —>■ oo, and, trivially. 


E 


Fn Fji 2j 


;\ J-l' 


> E 


Fn Fji 2j 


1-1 


Fn > ^n 


P{Fn > fin). 


On the event {L„ > fin}, we have Kn/Fn < n°‘/fln —>■ 0. Hence, by 
bounded convergence, the right hand side above tends to 1 as n —>■ oo and, 
since the other two terms in the estimate (fT^ clearly converge to 0, we are 
done. □ 
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